Corpus: spa-uy_web_2015_300K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 91 94 94 95 97
1000 822 928 953 961 970
10000 5987 8449 9375 9690 9766
100000 31995 65330 85417 94417 96859
1000000 66632 165897 238422 276376 287284


Zipf's diagram for sentence endings


Gnuplot diagram

34120 msec needed at 2018-06-23 11:42